fix(tui): allow agent starts to join parallel dispatch batches (#3801) by Hmbown · Pull Request #3812 · Hmbown/CodeWhale

Hmbown · 2026-06-29T23:54:19Z

Closes #3801 (child of #3800).

Problem

Multiple agent tool calls in one parent turn were serialized as non-parallel tools because the agent tool spec did not override supports_parallel() (defaulting to false). In high fanout (e.g. 20 concurrent agent launches), launch latency scaled linearly — each agent resolved its model route under the global tool-execution write lock, making the TUI feel frozen.

Change

Three input-aware overrides on AgentTool's ToolSpec impl:

starts_detached_for(action=start) = true — action=start launches a background agent and returns immediately. This lets agent launches join parallel batches via the detached_start path in plan_tool_execution_batches (which requires detached_start && !approval_required && !interactive). In auto-approved modes (YOLO), approval_required is already false, so the path fires.
supports_parallel_for(status|peek) = true and is_read_only_for(status|peek) = true — read-only queries of manager state can safely run in parallel batches.
approval_requirement_for(status|peek) = Auto — status/peek are read-only queries that don't need user approval.

Security / policy guardrails (per the issue)

No bypass of approval requirements: action=start and action=cancel still return ApprovalRequirement::Required. In non-YOLO modes, each agent launch still prompts for approval and is thus serial (expected). The parallel path only fires when auto_approve = true.
No bypass of concurrency limits: the manager's launch_gate semaphore, max_depth, and configured concurrency caps are unchanged. The global tool lock just doesn't serialize the launch portion.
Preserves one tool_result per tool_use: the existing dispatch/planner/executor pipeline handles this unchanged.
Does NOT mark arbitrary tools parallel-safe: only the agent tool opts in via its own ToolSpec impl.

Tests

agent_start_detached_plans_join_single_parallel_batch: 4 agent starts form 1 parallel batch, not 4 serial batches.
agent_start_detached_plans_batch_with_readonly_tools: mixed grep + agent start + grep form 1 parallel batch.

claude · 2026-06-29T23:54:36Z

Claude finished @Hmbown's task in 1m 44s —— View job

Code Review

Gather context and read changed files
Review crates/tui/src/tools/subagent/mod.rs
Review crates/tui/src/core/engine/tests.rs
Post findings

Summary

Clean, well-scoped change. The _for(input) override pattern is the right hook — it keeps ToolSpec callers in control and doesn't require touching the batch planner itself. Error handling is correct: all parse_agent_tool_action failures fall through to safe, conservative defaults (false / Required). The two new tests exercise the exact batch-planner code path being fixed.

Three things worth addressing:

Finding 1 — Misleading doc comment on `is_read_only_for` (minor / docs)

File: crates/tui/src/tools/subagent/mod.rs:3563

/// #3801: status/peek/cancel actions are read-only queries of manager state.
fn is_read_only_for(&self, input: &Value) -> bool {
    matches!(
        parse_agent_tool_action(input),
        Ok(AgentToolAction::Status) | Ok(AgentToolAction::Peek)
    )
}

The comment says cancel is read-only, but Cancel is not in the match arm and is definitely not read-only — it cancels a running agent (mutating manager state). The implementation is correct; the comment needs the word "cancel" removed.

Fix this →

Finding 2 — Missing negative test for non-YOLO mode (test coverage gap)

The PR's own security note says: "In non-YOLO modes, each agent launch still prompts for approval and is thus serial (expected). The parallel path only fires when auto_approve = true."

The two new tests only cover the auto-approved path (all plans have approval_required = false). There's no test asserting the inverse: that a detached-start agent plan with approval_required = true does not join a parallel batch. This is the most important guardrail to verify with a test, since tool_plan_can_join_parallel_batch checks !plan.approval_required and a regression there would silently re-serialize the fix.

Suggested test (mirrors the existing pattern):

#[test]
fn agent_start_requires_approval_does_not_join_parallel_batch() {
    let mut agent_start = make_plan_at(0, false, false, false, true); // approval_required=true
    agent_start.name = "agent".to_string();
    agent_start.detached_start = true;

    let batches = plan_tool_execution_batches(vec![agent_start]);
    assert_eq!(batches.len(), 1);
    assert!(matches!(&batches[0], ToolExecutionBatch::Serial(_)),
        "agent start requiring approval must not join parallel batch");
}

Fix this →

Finding 3 — No coverage for `action=cancel` parallelism isolation (minor)

The PR guarantees cancel stays serial (it returns Required and is not detached). There's no test asserting this. Given the security framing of this PR, a regression test guarding that action=cancel never joins a parallel batch would make the invariant machine-checkable. Low priority, but fits the pattern of the other negative tests in this file.

What's good

starts_detached_for correctly returns true only for Start, not for Cancel. The Cancel action's approval_required=true would block it from a parallel batch anyway, but it's good to be conservative.
approval_requirement_for falls to Required for any parse error or unknown action — no accidental auto-approval.
Tests use make_plan_at directly rather than instantiating a real AgentTool, keeping them fast and focused on the batch planner, which is the correct seam to test here.
The detached_start path in tool_plan_can_join_parallel_batch is unchanged — this PR's behaviour is purely additive, driven by the new ToolSpec impl.

Agent tool now reports starts_detached_for(action=start) = true, enabling independent agent launches to join parallel batches via the detached-start path in plan_tool_execution_batches. In auto-approved modes (YOLO), multiple agent calls in one model turn no longer serialize under the global tool-exec write lock — they dispatch concurrently up to configured concurrency limits. Also: - is_read_only_for returns true for status/peek (read-only queries) - supports_parallel_for returns true for status/peek - approval_requirement_for returns Auto for status/peek (no approval needed) Tests: agent_start_detached_plans_join_single_parallel_batch, agent_start_detached_plans_batch_with_readonly_tools. Closes #3801 (child of #3800).

claude · 2026-06-30T00:04:09Z

Claude finished @Hmbown's task in 2m 51s —— View job

Code Review

Load tools
Read changed files and diff
Analyze correctness, security, and test coverage
Post findings

Summary

The batch-planner hook design is clean and the _for(input) override pattern is exactly right — input-aware routing without touching the planner itself. The three existing findings from the prior review pass remain unaddressed; one new correctness concern emerges from reading inspect_agent_from_input.

Finding 1 — `is_read_only_for` contract is wrong: Status/Peek mutate manager state (correctness)

File: crates/tui/src/tools/subagent/mod.rs:3561

is_read_only_for returns true for Status and Peek, but both code paths in inspect_agent_from_input unconditionally call:

let mut manager = manager.write().await;
manager.cleanup(COMPLETED_AGENT_RETENTION);

cleanup() takes a mutable reference and removes/archives completed-agent records — this is a real write to shared state. Consequences:

Two "parallel" status calls in the same batch will contend on the write lock and serialize anyway, so the parallel-batch label buys nothing for them.
A status call in the same batch as an agent start means cleanup fires concurrently with spawn (whichever gets the write lock first). Today this is safe because cleanup only removes COMPLETED_AGENT_RETENTION-old agents. But the is_read_only contract implies this path can never race with a write — a future caller relying on that assumption would be wrong.

Either inspect_agent_from_input should use manager.read() for the inspection portion and decouple cleanup into a separate path (making the is_read_only claim true), or is_read_only_for for Status/Peek should return false (and these actions enter the batch via detached_start or a new non-read-only parallel path instead).

Fix this →

Finding 2 — Misleading doc comment on `is_read_only_for` (docs, prior review Finding 1)

File: crates/tui/src/tools/subagent/mod.rs:3560

/// #3801: status/peek/cancel actions are read-only queries of manager state.
fn is_read_only_for(&self, input: &Value) -> bool {

cancel appears in the comment but not in the match arm — Cancel mutates manager state and is correctly excluded from the match. Remove "cancel" from the comment.

Fix this →

Finding 3 — No negative test: approval_required=true must not join parallel batch (test coverage, prior review Finding 2)

This is the most important invariant to machine-check: the security note in the PR says "the parallel path only fires when auto_approve = true." There's no test asserting that detached_start=true, approval_required=true produces a Serial batch.

#[test]
fn agent_start_requires_approval_does_not_join_parallel_batch() {
    let mut plan = make_plan_at(0, false, false, true, false); // approval_required=true
    plan.name = "agent".to_string();
    plan.detached_start = true;

    let batches = plan_tool_execution_batches(vec![plan]);
    assert_eq!(batches.len(), 1);
    assert!(
        matches!(&batches[0], ToolExecutionBatch::Serial(_)),
        "agent start requiring approval must not join parallel batch"
    );
}

A regression in tool_plan_can_join_parallel_batch's !plan.approval_required guard would be silently invisible without this test.

Fix this →

Finding 4 — No negative test: cancel must not join parallel batch (test coverage, prior review Finding 3)

No test guards the invariant that action=cancel is never batched in parallel. Since cancel returns Required approval and isn't detached, it can't currently join a parallel batch — but a test makes this machine-checkable and prevents silent regression.

What's good

Error fallthrough in all four overrides is conservative (false / Required) — no accidental auto-approval or parallel-marking on parse failure.
starts_detached_for correctly returns true only for Start, not Cancel. Cancel is also blocked by approval_required=true even if detached_start were accidentally set.
The two new positive tests cover the exact dispatch-planner code path and use make_plan_at directly — fast and focused on the right seam.
No changes to the batch planner itself; this is purely additive.

Hmbown force-pushed the codex/0866-3801-agent-parallel-launch branch from 9b90118 to 2d9fce9 Compare June 30, 2026 00:03

Hmbown merged commit fd173b6 into main Jun 30, 2026
18 checks passed

Hmbown deleted the codex/0866-3801-agent-parallel-launch branch June 30, 2026 01:16

Hmbown mentioned this pull request Jun 30, 2026

v0.8.66: Release gate for multi sub-agent fanout freeze #3800

Closed

10 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(tui): allow agent starts to join parallel dispatch batches (#3801)#3812

fix(tui): allow agent starts to join parallel dispatch batches (#3801)#3812
Hmbown merged 1 commit into
mainfrom
codex/0866-3801-agent-parallel-launch

Hmbown commented Jun 29, 2026

Uh oh!

claude Bot commented Jun 29, 2026 •

edited

Loading

Uh oh!

claude Bot commented Jun 30, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Hmbown commented Jun 29, 2026

Problem

Change

Security / policy guardrails (per the issue)

Tests

Uh oh!

claude Bot commented Jun 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Code Review

Summary

Finding 1 — Misleading doc comment on is_read_only_for (minor / docs)

Finding 2 — Missing negative test for non-YOLO mode (test coverage gap)

Finding 3 — No coverage for action=cancel parallelism isolation (minor)

What's good

Uh oh!

claude Bot commented Jun 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Code Review

Summary

Finding 1 — is_read_only_for contract is wrong: Status/Peek mutate manager state (correctness)

Finding 2 — Misleading doc comment on is_read_only_for (docs, prior review Finding 1)

Finding 3 — No negative test: approval_required=true must not join parallel batch (test coverage, prior review Finding 2)

Finding 4 — No negative test: cancel must not join parallel batch (test coverage, prior review Finding 3)

What's good

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

claude Bot commented Jun 29, 2026 •

edited

Loading

Finding 1 — Misleading doc comment on `is_read_only_for` (minor / docs)

Finding 3 — No coverage for `action=cancel` parallelism isolation (minor)

claude Bot commented Jun 30, 2026 •

edited

Loading

Finding 1 — `is_read_only_for` contract is wrong: Status/Peek mutate manager state (correctness)

Finding 2 — Misleading doc comment on `is_read_only_for` (docs, prior review Finding 1)